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-continued 



T4 Ligasc 4u 
H 2 0 



final volume - 10 



^ligation reactions .ereino.bated at 12" C. for 16 

35 hours. 

Transformation of the ligation reaction products 
into INVaF competent E. coh 

a <; ,,\ nf the liRation reaction into 100 fA of 
We r^ovedMof he ug« cells were purchased 
comp etent INVoF * 1 WWJ ^ rf 500 ^ 

from Invitrogen, an • > ^ minules , beat 

p-mercaptoethanol and incubated on ^ for 

'shocked for 45 «co«b-l « J, ^ d soc ^ 

2 r Ut6 h Ift 37« a G ^ - »5 rpm for 1 hour 

and incubated at 37 C. wi reacUon on lb 

followed by plaung all the ampicillin. 
broth agar plates containing lOO/ig/rm *n. F 
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Selection of transformants 

Six colonies were K cSS^g 

ampiciUin. DNAwas ^^^t, phenol extracted, 
alkaline lysis extraction P roc< *™ 6 ? *L , H 0 Small 
fecipitati dried and ^m^^fA^ 
aliquots of each of '^^I fn " The digested DNAwas 

r."v»"^-^ * iui " ip8 " ,h 

transfections. 

COS7 transfection 
• ii transection was carried out according 
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HIGH LEVEL OF EXPRESSION OFINGAP m is A \* s*r , a 
B IN BACTERIAL ANI^TORARYOT^ CELLS * H K [Q I ( C 

This is a continuation-in-part application of U.S. Ser. No 



This is a continuation-in-part application of U.S.Ser No • j fcefad* 

08/741,096 filed Oct 30, 1996, now abandoned ^ ^ Q±s t ^ ^ ,p £ ~ I SS"^ *-p (> L ' 

^j)^ mvent i° D k related to methods and constructs for ^ ) 



yy 08/709,662, and 60/006,271. The disclosure of these appli- 

yj cations is expressly incorporated herein. Despite the knowl- 

J" 5 edge of the complete nucleotide sequence of the INGAP 

gene, expression of the protein has been limited. Thus there 35 

ypl is a need in the art for methods of expressing and isolating 

3 large quantities of the INGAP protein, especially in eukary- 

^ otic systems. 

%fi _ , SUMMARY OF THE INVENTION 40 

%xJ g I It is an object oftjte present invention to provide a method 

fQ of producing bipfcgically active INGAP protein from a 

f4 ^ . recombinant h^st cell. * 

40 x. It is another object of the present invention to provide a 45 
^ host cell which expca&es large amounts of INGAP protein. 

^3 It is an object^pi-^he present invention to provide a 
recombinanj^rlslruct for expression of biologically active 
£ >. INGAP^tfotein. . 

gtf Another object ofthe invention is to provide a method for 50 
isolating INGAI^protein from a recombinant host cell. 
* gi~ These and other objects of the invention are achieved by 
providing the aruvith a recombinant construct for expres- 
sion of biologically active INGAP protein comprising: 

a first nucleotide sequence encoding amino acids 27 to 
175 SEQ ID NO: 6 operably linked to a transcriptional, 
initiation site and a translational initiation site, wherein a 
second nucleotide sequence encoding a signal peptide is not 
present immediately 5' of said first nucleotide sequence. ^ 

In another embodiment of the invention a method of 
producing INGAP activity from a recombinant host cell is 
provided. The method comprises the steps of: 

culturing a host cell comprising a recombinant construct 
comprising a first nucleotide sequence encoding amino acids 65 
27 to 175 SEQ ID NO: 6 operably linked to a transcriptional 
initiation site and a translational initiation site, wherein a 



achieving high level expression of INGAP, a protein 
involved in islet cell neogenesis. 

BACKGROUND OF THE INVENTION 

Pancreatic islets of Langerhans are the only organs of 
insulin production by p cells in the body. However, they is 
have a limited capacity for regeneration. This limited regen- 
eration capacity predisposes mammals to develop diabetes 
mellitus. Thus there is a need in the art of endocrinology for 
products which can stimulate the regeneration of islets of 
Langerhans to prevent or ameliorate the symptoms of dia- 20 
betes mellitus. 

There are many factors regulating pancreatic p cell mass. 
(Vinik, et al., Diabetes Reviews 4: 235-263, 1996.) A 
pancreatic extract called ilotropin induces p cell regenera- 
tion and reverses diabetes. (Rosenberg et al. (1996) Diabe- 25 
tologia 39: 256-262. A gene encoding a protein within 
ilotropin has been identified and isolated; the protein is 
responsible for stimulating islet cell regeneration. 
(Rafaeloff, R. Journal of Clinical Investigations 99: 
2100-2109, 1997.) This protein is called INGAP, and is 30 
disclosed in patent applications Ser. Nos. 08/401,530, 



hi 



LJ 



10 



2 

second nucleotide sequence encoding a signal peptide is not 
present immediately 5' of said first nucleotide sequence, 
recovering protein from said cultured host celL 
In yet another embodiment of the invention a host cell is 
provided. The host cell comprises a recombinant construct 
comprising a first nucleotide sequence encoding amino acids 
27 to 175 SEQ ID NOyo operably linked to a transactional 
initiation site and a/u-anslational initiation site, wherein a 
second nucleotide ysJequence encoding a signal peptide is not 
present immediately 5' of said first nucleotide sequence. 
These and othe^smbodiments of the invention which will be 
apparent to th*5se of skill in the art provide a practical source 
of INGAP protein in amounts suitable for use in preclinical 
and clinical situations. 

15 ^ 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1. SDS-PAGE gel of products of bacterial transfec- 
tion. Bacterial lysate without transfection (CBL), bacterial 
20 lysate with transfection (TBL), fractions from Ni-NTA chro- 
matography (eluted at pH63 (6.3); pH 5.9 (5.9); and pH 4.5 
(4.5) and standards (Std). 

FIG. 2. ECL film of Western blot using INGAP antibody 
945-2. Lanes are as identified in the description to FIG. 1. 

25 
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DETAILED DESCRIPTION 

It is a discovery of the present inventors that bacterial 
expression as well as eukaryotic expression of INGAP can 
be achieved at high levels by deleting the coding sequence 
of the signal sequence of INGAP. While not wanting to be 
bound by any particular theory or mechanism of action, 
applicants believe that the signal sequence is toxic to host 
systems. The signal sequence comprises amino acids 1 to 26 
as shown in SEQ ID NO: 5. In the constructions tested, the 
5' untranslated region comprising nucleotides 1-16 SEQ ID 
s NO. 1 has also been deleted. This deletion may also con- 

£3 tribute to the increase in expression which has been 

.j4 observed. 

40 Applicants have found that an inducible transcription 
~ initiator is exceedingly useful for INGAP expression in 

iy prokaryotic systems. Suitable inducible transcription initia- 

Q tors include the lac promoter/operator, the lac promoter, the 

trp promoter, the Kcl promoter, the let promoter, as well as 
45 others which are known in the art. 

According to another aspect of the invention, a histidine 
tag can be put on the protein. The histidine tag can simplify 
processing and purification. A histidine tag is a stretch of 
histidine residues which is appended to a protein, usually by 
50 genetic engineering. Preferably the tag comprises between 3 
and 12 histidine residues. They may be contiguous or 
interrupted by other residues. The histidine tag may be 
appended to the N- terminal or to the C-terminal end of the 
protein to minimize disruption of protein function. Methods 
55 for making and utilizing histidine tags are known in the art. 
The oligohistidine can be used as an affinity moiety using a 
metal chelate, such as nickel-NTA (N-(5-amino-l- 
carboxypentyl)-iminodiacetic acid) as the other affinity part- 
ner. 

60 A recombinant construct according to the invention, is 
any DNA molecule which has been engineered so that two 
segments of DNA are adjacent to each other which are not 
adjacent to each other in nature. Preferably such engineering 
is performed in vitro, although in vivo engineering can also 
65 be performed. The construct may be a plasmid, phage, virus, 
transposable element, minichromosome, or other element, as 
is suitable for the desired application. 
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In a preferred embodiment, the construct for INGAP 
expression in eukaryotic systems contains an origin of 
replication, e.g., EBV oriP, which permits extrachromo- 
somal maintenance in primate and canine cell lines. A 
sequence element which encodes a nuclear antigen and 5 
allows high level episomal replication, e.g., EBNA-1 or 
O ^SV40 T antigen is also desirable. 

The coding sequence of amino acid residues 27-175 of 
& INGAP protein areJncluded in the constructs. Preferably the 
entire signal sequence is deleted. However, it is possible that 10 
only a portion 4i the signal sequence must be deleted to 
obtain excellent expression. Thus some portion of the signal 
sequence miant be retained in the constructs. 

Deletion of the 5' untranslated region, nucleotides 1-16, 
is also desirable. However, it is not known if this is neces- 15 
sary to achieve excellent expression. Thus the 5' untranslated 
region may be retained in some constructs without departing 
from the spirit of the invention. 

A host cell according to the invention can be transfected 
or transformed with a recombinant construct according to 
the present invention. The host may be a bacteria, yeast, 
insect, or mammalian cell. For eukaryotic expression of 
INGAP, any cell lines suitable for protein expression may be 
used, including COS-7 cells and CHO cells. 25 

Selection of suitable promoters and translational initiators 
for use in the appropriate host cell is well within the ability 



20 



of those skilled in the art. For eukaryotic expression system, ^ 
it is exceedingly useful to choose a promoter sequence 
which is capable of initiating constitutive transcription to 
achieve constitutive high level expression of the protein. 
Rous sarcoma vims long terminal repeat (RSVLTR) is an 
example of such promoter, although others as are known in 45 
the art can be used. 

Host cells may be transformed, transfected, mated or 
infected with the recombinant host cell of the present 
invention. Culturing of host cells can be performed using 50 
techniques and media which are well known in the art. 
Again, a suitable medium and technique can be selected by 
an ordinary skilled artisan. 

The above disclosure generally describes the present 
invention. A more complete understanding can be obtained 55 
by reference to the following specific examples which are 
provided herein for purposes of illustration only, and are not 
intended to limit the scope of the invention. 

EXAMPLE 1 

60 

This example describes the experimental design 
employed. 

We generated a new INGAP cDNA by PCR which 
excluded the 5' UTR region (nucleotides 1-16 in SEQ ID: 1) 
and nucleotides encoding the signal peptide (nucleotides 65 
17-94 SEQ ID NO: 1) and created two new restriction 
enzyme recognition sites enabling the insertion of the new 
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construct into a new pQE-31 expression vector. This new 
ligaled construct was transformed into TO PI OP competent 
cells (£. coli host strain from lnvitrogen). The positive 
clones were identified, verified by restriction enzyme diges- 

5 lion and the DNA isolated. The DNA was now transformed 
into a different competent E. coli strain, M15(pREP4) and 
expression of the protein was induced by IPTG (isopropyl- 
beta, D-thiogalactopyranoside) which inhibits the repressor, 
facilitating expression of the protein from the M15 

10 promoter/operator. The reason for the intermediate transfor- 
mation of the ligated materia] into TOP10F 1 is that these 
cells are highly competent increasing the odds of getting 
insert positive colonies. The M15(pREP4) cells that were 
used for protein expression do not attain competency levels 

15 high enough to guarantee transformation of the ligation 
products. The resultant plasmid DNA obtained from the 
transformation of the TOP10F was sufficient to enhance the 
transformation of the M15(pREP4) cells. The His-lagged 
protein was isolated by Ni +2 agarose affinity purification. 

20 We used a PCR approach to generate a new INGAPcDNA ^ 
which excludes the 5' UTR region (nucleotides 1-16 in SEQ 
ID NO: 1) and nucleotides encoding the signal peptide. 

The nucleotide sequence (SEQ ID NO: 1) and cone- l 
25 sponding amino acid sequence (SEQ ID NO: 5) that have \ 
been excluded are as follows: (the bolded area represents the 
sequence of the signal peptide) 

CTGCAAGACAGGTACC ATG ATG CTT CCC ATG ACC CTC TGT AGG 
MET MET Leu Pro MET The Leu Cys Arg 
ATG TCT TGG ATG CTG CTT TCC TGC CTG ATG TTC CTT TCT TGG 
MET Ser Trp MET Leu Leu Ser Cys Leu MET Pfae Leu Ser Trp 
GTG GAA GGT 
Val Glu Gly 



w EXAMPLE 2 

This example describes the use of polymerase chain 
reaction to synthesize INGMAT (a construct which lacks the 
45 signal peptide sequence, i.e., which encodes the mature 
protein). 

Oligonucleotide design: 

50 Oligonucleotides for PCR were designed to incorporate 
restriction enzyme recognition sites at their respective 5 1 
ends. The oligonucleotide designed for the 5' end of the gene 
incorporates a Bam HI site followed by 20 nucleotides 
corresponding to the N-tcrminus of the mature protein. The 

55 oligonucleotide designed for the 3' end incorporates an Xho 
I site followed by 20 untranslated nucleotides. The PCR 
product generated from these primers contains the mature 
INGAP sequence and the native protein termination codon. 

60 The following is the sequence of the oligonucleotides 
used: 

5* of INGAP (SEQ ID NO: 2) 

S'-CCGCGGATCCCGAAGAATCTCAAAAGAAACr-T 
65 3' of INGAP (SEQ ID NO: 3) 

S'-GACCGGCTCGAGTGCTCTTCCTGAGTGAATCC^' 



PCR of INGMAT 
Reaction conditions 



Template: (50 ng INGAP original cDNA 

removed from pCDNA3) 

MgQ 2 : 

10 X PCR buffer 



4/d 
M 

M 
M 

M 
29/zl 

M 



dATP 
dCTP 
dGTP 
dTTP 
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5' primer 
3' primer 
H 2 0 

Taq polymerase 



15 



total volume - 50 /d 



cycle parameters 

A) 2 min at 95° C. 

B) 30 cycles of (1 min 95° C, 1 min 55° C, 1 min 72° 2 ° 
C) 

Q 7min at 72° C. 

D) 4° C. until removed from thermal cycler. 

Hie PCR products were then electrophoresed on a 5% 25 
PAGE in TBE Ethidium bromide stained PCR products 
corresponding to the expected size for the construct were cut 
from the gel. The gel fragments were electro-eluted into 0.5 
ml of TBE, precipitated with 50 /4 3M sodium acetate and 
1 ml of isopropanol at -80° C. for 20 min, centrifuged, 30 
washed once with 1 ml of isopropanol, washed once with 1 
ml of 70% ethanol, and then dried under vacuum. The drie<T) 
pellet was resuspended in 50 /d H 2 0 and quantified. At the I 
end of this step the sequence of the PCR product that % 
contains both restriction sites minus the signal sequence and J 
5' UTR was as follows (SEQ ID NO: 4): 



5 -CC GCG 


GAT 


CCC 


GAA 


GAA 


TCT 


CAA 


AAG 


AAA 


CTGCCT 




TCT 


TCA 


CGT 


ATA 


ACC 


TGT 


CCT 


CAA 


GGC 


TCT 


GTA 


GCC 


TAT 


GOG 


TCC 


TAT 


TGC 


TAT 


TCA 


CTG 


ATT 


TTG 


ATA 


CCA 


CAG 


ACC 


TGG 


TCT 


AAT 


GCA 


GAA 


CTA 


TCC 


TGC 


CAG 


ATG 


CAT 


TTC 


TCA 


GGA 


CAC 


CTG 


GCA 


TTT 


CTT 


CTC 


AGT 


ACT 


GGT 


GAA 


ATT 


ACC 


TTC 


GTG 


TCC 


TCC 


CTT 


GTG 


AAG 


AAC 


AGT 


TTG 


ACG 


GCC 


TAC 


CAG 


TAC 


ATC 


TGG 


ATT 


GGA 


CTC 


CAT 


GAT 


CCC 


TCA 


CAT 


GGT 


ACA 


CTA 


CCC 


AAC 


GGA 


AGT 


GGA 


TGG 


AGG 


TGG 


AGC 


AGT 




TCC 


AAT 


GTG 


CTG 


ACC 


TTC 


TAT 


AAC 


TGG 


GAG 


AGG 


AAC 


CCC 


TCT 


ATT 


GCT 


GCT 


GAC 


CGT 


GGT 


TAT 


TGT 


GCA 


GTT 


TTG 


TCT 


CAG 


AAA 


TCA 


GGT 


TTT 


CAG 


AAG 


TGG 


AGA 


GAT 


TTT 


AAT 


TGT 


GAA 


AAT 


GAG 


CTT 


CCC 


TAT 


ATC 


TGC 


AAA 


TTC 


AAG 


GTC 


TAG 


GGC 


AGT 


TCT 


AAT 


TTC 


AAC 


AGC 


TTG 


AAA 


ATA 


TTA 


TGA 


AGC 


TCA- 


CAT 


GGA 


CAA 


GGA 


AGC 


AAG 


TAT 


GAG 


GAT 


TCA 


CTC 




AGG 


AAG 


AGC 


ACT 


CGA 


GCC 


GGT 


C-3' 













♦The bolded areas represent the primers. 
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EXAMPLE 3 



This example describes the creation of a plasmid contain- 
ing the expression construct. 

5 

Restriction enzyme digestion of the INGMAT PCR 
product and the pQE-31 vector 

We performed two parallel restriction enzyme digestion 

10 reactions using 2.5 ftg of both the INGMAT PCR product 
and pQE-31 vector. INGMAT was digested with Bam HI 
and Xho I simultaneously in a 30 jA volume. PQE-31 was 
digested with Bam HI and Sal I simultaneously in a 30 /d 
volume. Both digestion reactions were carried out at 37° C. 

15 for a period of 4 hours. After the reactions were completed, 
400 ng of each was electrophoresed on a 15% agarose gel 
and stained with ethidium bromide to assure complete 
digestion. The remainder ("2.1 ug) of both digestion reac- 
tions were passed over a seph arose G-50 to remove the small 

20 DNA fragments followed by two equal volume phenol 
extractions. The extracted DNA was then precipitated with 
2 volumes of ethanol and Yio volume 3M sodium acetate at 
-80° G. for 20 minutes, centrifuged, washed twice with 70% 
ethanol and dried under vacuum. The pellets were resus- 

25 pended in 25 fx\ H 2 0 and quantified. 

The pQE-31 expression system was purchased from 
QIAGEN Inc. Chatsworth, Calif. 

30 Ligation of INGMAT into pQE-31 

INGMAT (Bam Hl/Xho I) and pQE-31(Bam Hl/Sal I) 
have compatible ends suitable for ligation. As a result of the 
ligation the Sal I restriction site in the vector will be 
35 eliminated. 

Ligation conditions using a 2:1 vector to insert molar 
ratio. 



pQE31 (vector) 517 ng 
IN G MAT (insert) 165 ng 
10 X ligation buffer 
10 mM rATP 
T4 Ligasc 4u 
H 2 Q 



9/d 
2.5 fi\ 
5/d 

SfA 
M 
27.5 //I 



5 



final volume - 50 /il 



10 

The ligation reactions were incubated at 4° C. for 16 
hours. 

Transformation of the ligation reaction products 

into TOP10F competent E. coli 15 

We removed 5 /J of the ligation reaction into 100 ^1 of 
competent TOP10F cells, (TOPIOF cells were purchased 
from Invitrogen, San Diego, Calif.) with 0.5 fi\ of 500 mM 
p-mercaptoethanol and incubated on ice for 30 minutes, heat 
shocked for 45 seconds at 42° C, and recovered on ice for 20 
2 minutes. Then we added 1 ml of prewarmed S° C. media 
and incubated at 37° C. with shaking at 225 rpm for 1 hour 
followed by plating all the transformation reaction on LB 
broth agar plates containing 100 ^g/ml ampicillin. 



Colony containing plates were lifted onto Nytran mem- 
branes. The colonies were lysed with 0.5M NaOH, 
neutralized, and the resultant DNA bound to the membrane 30 
by baking at 80° C. for 1 hour. The membranes were then 
hybridized in 50% formaldehyde, 5xSSPE at 50° C for 16 
hours with 3xl0 6 cpm/ml of 32 P random primed INGAP 
cDNA. The membranes were washed at high stringency and 
exposed to X-ray film. Positive colonies were matched up to 35 
the X-ray film and grown up in 3 mis of LB with ampicillin. 

DNA isolation from positive transformants 

DNA was isolated from the small cultures using alkaline 
lysis, phenol extracted, precipitated, dried, and resuspended 40 
in 50 ,ul H 2 0. A small aliquot of each of the isolated DNA 
were digested with Bam HI and Hind III to release inserts. 
The digested DNAs were electrophoresed on 1.5% agarose 
and stained with ethidium bromide and positive inserts 
identified at approximately 510 bp size range. We took four 45 
of the insert containing plasmids and incubated them in the 
presence of RNAse to remove any residual bacterial RNA. 

Transformation of the ligation products into MIS 

(pREP4) competent E. coli so 

We removed 5 /d of the cleaned DNA isolated in section 
HE and transfer it into 100 jd of M15(pREP4) competent 
cells. The mixture was incubated on ice for 30 minutes, heat 
shocked for 45 seconds at 42° C, and recovered on ice for 55 
2 minutes. 1 ml of prewarmed SOC media was added and 
incubated at 37° C with shaking at 225 rpm for 90 minutes. 
All of the transformation reaction was plated on LB broth 
agar plates containing 100 /ig/ml ampicillin and 25 #g/ml 
kanamycin. 60 

Selection of transformants for INGMATHIS 
(INGMAT plus a six-histidine tag) protein 
production 

Eight colonies were picked and grown up in LB with 65 
ampicillin. DNA was isolated from the small cultures using 
alkaline lysis extraction procedures, phenol extracted, 



Selection of transformants 
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precipitated, dried, and resuspended in 50 /d H 2 0. A small 
aliquot of each of the isolated DNAwere digested with Bam 
HI and Hind III to release inserts. The digested DNA was run 
on 1.5% agarose gel and visualized by staining with 
5 ethidium bromide. Several of the transformants which dem- 
onstrated the plasmid with inserts of the correct size as well 
as the presence of the pREP4 plasmid were stored in 50% 
glycerol at -80° C. to be used for protein production. 

10 EXAMPLE 4 

This example describes denaturing metal affinity protein 
chromatography isolation of the his tagged INGAP protein 
without signal peptide. (Procedure for a 250 ml pING- 
MATHIS transformed M15 (pREP4) culture. pINGMATHIS 
15 is the INGMATHIS construct ligated into the pQE-31 
vector.) 

Bacteria growth and protein induction 

We grew a 25 ml overnight in LB with 100 g/ml ampi- 
20 cillin and 25 ^g/ml kanamycin antibiotic. We started a 250 
ml LB plus 100 ^g/ml ampicillin and 25 ^g/ml kanamycin 
culture with 5 ml of the overnight. (1:50) Grown until 
ABS600-0.0.75 to 0.9 (actual OD-0.866). Added 5 ml of 
100 mM IPTG (2 mM final) to induce production of the 
25 protein. Continue growing for 4 hours in the case of INGAP. 
Collected the bacteria and spin at 6000 rpm for 20 minutes, 
discarded the supernatant. The pellet was frozen until ready 
to use at -70° C. 

3Q Ni +2 NTA agarose preparation 

Prepare as much as will be needed. (Use 10 ml of the 50% 
Ni* 2 NTA for each 250 ml derived bacterial pellet). Place 16 
ml of the 50% slurry into a disposable 50 ml centrifuge tube. 
Centrifuge for 2 minutes at 800xg and discard the supema- 

35 tanl. Add 42 ml of sterile water, resuspend the resin. Cen- 
trifuge for 2 minutes at 800xG and discard the supernatant. 
Add 42 ml of sterile water, resuspend the resin. Centrifuge 
for 2 minutes at 800xG and discard the supernatant. Add 42 
ml of binding/lysis buffer A (6M Guanidine HC1, 0.1 M 

40 sodium phosphate, 0.01M Tris, pH 8.0) and resuspend the 
resin. Centrifuge for 2 minutes at 800xG and discard the 
supernatant. Add 42 ml of binding/lysis buffer A (6M 
Guanidine HCL, 0.1M sodium phosphate, 0.01M Tris, pH 
8.0) and resuspend the resin. Centrifuge for 2 minutes at 

45 800xG and discard the supernatant. Add 42 ml of binding/ 
lysis buffer A (6M Guanidine HCL, 0.1M sodium phosphate, 
0.01M Tris, pH 8.0) and resuspend the resin. Centrifuge for 
2 minutes at 800xG and discard the supernatant. Bring the 
total volume up to 10 ml with buffer A. The slurry is now 

50 ready for the application of the lysed bacteria. 

Bacteria lysis and protein isolation 

Thaw the bacterial pellet for 15 minutes at room tem- 
perature. Resuspend the pellet in 12^ ml of lysis buffer A. 

55 (6M Guanidine HCL, 0.1M sodium phosphate, 0.01M Tris, 
pH 8.0). Transfer the resuspension to a 50 ml centrifuge 
tube. Freeze the resuspension/lysate at -70 until solid. Thaw 
at room temperature. Place the lysate on a rotator for 60 
minutes at room temperature. Centrifuge the lysate for 15 

60 minutes at 10,000xG. Collect the supernatant and add the 10 
ml of prepared Ni2+NTA. Rotate for 45 minutes. Load the 
slurry onto a 1.6 cm diameter column and allow to flow 
through by gravity. 

Washes 

65 

Flow through 50 ml of buffer A. (No need to collect.) 
Flow through 40 ml of buffer B (8M Urea, 0.1M Sodium 



phosphate, 0.01M Tris, pH 8.0). (No need to collect.) but 
should be at or near zero before continuing, if not, then 
wash with more. Wash through 40 ml of buffer C, same as 
B but pH 6.3. Collect 3 ml fractions. Wash through 40 ml of 
buffer D, same as B but pH 5.9. Collect 3 ml fractions. 5 

Wash through 40 ml of buffer E, same as B but pH 4.5. 
Collect 3 ml fractions. At this point the protein should be in 
one of the fractions taken. Read the absorbance at 280 of all 
the fractions to discern where the protein is. Pool, reduce, 10 
and SDS page electrophoresis as necessary. 



Dialysis 

In order to purify the expressed protein, we changed the 15 
carrier solution of the fraction extracted from the nickel/ 
NTA at pH 4.5 to Tris buffer using dialysis. Dialysis tubing 
with a molecular weight cut-off of 3000 was prepared by 
boiling in 5 mM EDTA/200 mM sodium bicarbonate for 5 
minutes. The tubing was rinsed briefly in deionized water 20 
and boiled another 5 minutes in the bicarbonate solution. 
The tubing was returned to deionized water, covered with 
aluminum foil and autoclaved for 10 minutes on a liquid 
cycle. The tubing was handled with latex gloves during the 
entire procedure. 25 

One ml of the protein solution from the nickel/NTA 
column in 6M guanidine HC1 was dialyzed against 4 liters 
of 25 mM Tris buffer at pH 8.5 for 12 hours. After dialysis, 
there were 2 mis of protein solution with a protein concen- 
tration of 800 ug/ml. 30 



EXAMPLE 5 
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45 



50 



This example describes analytical techniques confirming 
the identity of the product 

^-^^ SDS-PAGE 

In order to test for the oWi-expression of the INGAP 
protein, discontinuous denatimng polyacrylamide gel elec- 
trophoresis was performed 0^ the dialyzed protein solution 
using the Hoefer SE250 Mighty Small II apparatus. The 
separating gel was prepared with 15% acrylamide, 1.35% 
bis-acrylamide in 375 mM Tris buffer at pH 8.8 with 0.05% 
sodium dodecyl sulfat/ Polymerization was induced by 
addition of 0.05% amnfonium persulfate and 20 /d TEMED/ 
15 ml solution. The Solution was placed in the gel plate 
apparatus for polymerization. The stacking gel was poured 
with the same solution, except the Tris buffer was 125 mM 
at pH 6.8, and thi acrylamide concentration was 4%. The 
protein samples ywere diluted 1:1 with sample buffer (125 
mM Tris-Cl, pH 6.8, 4% SDS, 20% glycerol, and 10% 
2-mercaptoeth/nol). 

The upper and lower tank buffers were identical, contain- 55 
ing 25 mM Tris, 192 mM glycine and 0.1% SDS at pH 8.3. 
Two gels were loaded with 20 fA each of bacterial lysate 
without transfection (CBU368 ug/ml), bacterial lysate with 
transfection CTBL, 341 ug/ml), the fractions from Ni-NTA 
chromatography (eluted at pH6.3, 110 ug/ml; pH 5.9, 100 60 
ug/ml; and pH 4.5, 800 ug/ml) and standards (Rainbow 
Markers, Amersham and Dalton Mark- VII, Sigma). Elec- 
trophoresis was performed at 20 mA constant current until 
the dye front entered the separating gel, and at 60 mA 
constant current until the dye front reached 0.5 cm from the 65 
bottom. The gels were then removed and one was fixed with 
45% methanol/10% acetic acid for one hour, and the other 
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was placed in transfer buffer (25 mM Tris, 192 mM glycine, 
20% methanol, pH 83) for 20-30 minutes. 

Silver Staining 

The fixed gel was equilibrated with 2 changes of 10% 
ethanol/5% acetic acid for 30 minutes each. The gel was then 
exposed to a 0.0032N HN03/K2Cr207 solution for 5 min- 
utes. The gel was washed in deionized water 3 times for 10 
minutes each. The gel was impregnated with silver using 0.1 
g AgNOj/50 ml H 2 0 for 30 minutes. The silver solution was 
washed off the gel in deionized water for 5 minutes. The gel 
was then exposed to a developer solution (29.7 g anhydrous 
15 Na2C0 3 in 1 liter H 2 0 with 0.5 ml formalin) in 5 minute 
intervals between changes until the desired density was 
reached. The development was stopped with 10% acetic 
acid, and the gel stored in H 2 0. 

The gel showed a pnftein band of approximately 19 kD 
^ that was prominent inrthe bacterial lysate from transfected 
cells and in the thxtian fraction from pH 4.5 on nickel/NTA 
(FIG. 1). This protein was not represented in any of the other 
samples. This is consistent with the size of INGAP protein 
^ and with interaction of the inserted histidine tagging region 
with the nickjJ/NTA column matrix. 

Western Blotting 

30 Immobilon-P PVDF membrane was wetted with 100% 
methanol, and equilibrated with transfer buffer for 10 min- 
utes. The gel was removed from transfer buffer and placed 
on the PVDF membrane. All bubbles between the membrane 
and the gel were removed. The combination was placed 
35 between Whatmann 3 mm filter paper weUed with transfer 
buffer and the whole "sandwich" was placed in the cassette 
of a Hoefer transfer tank. The cassette was placed in the 
transfer tank filled with transfer buffer with the gel toward 
the cathode. The transfer was performed at 12V constant 
40 voltage for 18 hours. 

After transfer, the membrane was placed in a blocking 
buffer of 0.5M Tris, 2M NaCl and 1% polyethylene glycol 
with 5% bovine serum albumin and 10% goat serum at room 
temperature for 1 hour. The membrane was then placed into 
20 ml of blocking buffer containing INGAP antibody 945-2 
at a dilution of 1:5000 and incubated at room temperature for 
1 hour. The membrane was then washed 3 times for 15 
minutes each with 50 ml of washing buffer (0.4% Tween-20 
in phosphate-buffered saline (PBS) at pH 7.4). The mem- 
brane was then incubated for 1 hour at room temperature in 
washing buffer containing anti-rabbit IgG (whole molecule, 
Sigma Cat # A-0545) peroxidase conjugate at a 1:160,000 
dilution. The membrane was washed 3 times for 5 minutes 
in 50 ml of 0.2% Tween-20 in PBS, followed by 3 washes 
of 5 minutes each with 0.1% Tween-20 in PBS. The blot was 
revealed using the enzyme chemtiuminescence kit from 
Amersham Corp., Arlington, EL according to instructions. 
The ECL blot was exposed to Kodak X-Omat AR-5 X-ray 
. film for 20 minutes. 

ECL of the blot revealed strong protein recognition of the 
J>/0 overexpressed 19 kD proteins in the whole lysate from 
^ transfected bacteria (IBL) arfd the pH 45 fraction that were 

visualized on the SDS-PAOE gels (FIG. 2). In addition, there 
65 was a protein band recognized in both bacterial lysales at 40 
kD, implying that this protein is weakly recognized and is a 
bacterial protein rather/than a product of the transfection. 
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Finally, there was a light band at 14 kD recognized by the 
antibody in both the transfected bacterial lysate and in the 
pH 4.5 fraction. This may either be another protein or a lytic 
fraction of the INGAP protein. Given the engineering done 
to produce the INGAP protein it is most likely a lytic 5 
^ iraction of INGAP. 

^t^L* In summary, we have been/able to express INGAP protein 
*t in a prokaryotic system by excluding the 5' UTR and the 
signal peptide and insertion of the new construct into a new 
vector. Toe resultant p/otein is of the predicted molecular 10 
size of INGAP monomer and reacts with the antibody to 
INGAP in a Weston analysis. The protein shares with 
INGAP peptide ty ability to induce ductal cell proliferation. 

EXAMPLE 6 15 

This example describes the experimental design 
employed for INGAP expression in eukaryotic systems. 

We generated an INGAP cDNAby PCR which excluded 
the 5' UTR region (nucleotides 1-16 in SEQ ID: 1) and 20 
nucleotides encoding the signal peptide (nucleotides 17-94 
SEQ ID NO: 1). The reason for excluding the 5' UTR region 
was to create a protein that is similar to the native protein in 
which the 5' UTR is not part of the protein. We also created 
two new restriction enzyme recognition sites enabling the 25 
insertion of the new construct into a new pEBVHis-B 
eukaryotic expression vector. This new ligated construct was 
transformed into INVaP competent cells (E. coli host strain 
from Invitrogen). The positive clones were identified, veri- 
fied by restriction enzyme digestion and the DNA isolated 30 
and transfected into COS-7 cells. The His-tagged protein 
was isolated by Ni +2 agarose affinity purification. The iso- 
lated protein showed biological activity when used to stimu- 
late proliferation of ARIP (ductal) cells as measured by 
3 H-TdR incorporation. 35 

We used a PCR approach to generate a new INGAP cDNA 
which excludes the 5' UTR region (nucleotides 1-16 in SEQ 
ID NO: 1) and nucleotides encoding the signal peptide. 

The sequence (SEQ ID NO: 1) that has been excluded is 
as follows: (the bolded area represents the sequence of the 40 
signal peptide) 

CTGCAAGACAGGTACCATG ATG CTT CCC ATG ACC CTC TGT 
MET MET Lea Pro MET The Lea Cys 
AGG ATG TCT TGG ATG CTG CTT TCC TGC CTG ATG TTC 
Arg MET Set Trp MET Leu Leo Ser Cys Leu MET Phe 
CTT TCT TGG GTG GAA GGT 
Leu Ser Trp Val GIu Gly 



To engineer the new INGAP construct we designed oHgo- 55 
nucleotides corresponding to the 5 1 and 3 f ends of the INGAP 
sequence to be amplified. 

EXAMPLE 7 

This example describes the use of polymerase chain 60 
reaction to synthesize INGMAT (construct which lacks the 
signal peptide sequence, i.e., which encodes the mature 
protein). 

Oligonucleotide design 65 

Oligonucleotides for PCR were designed to incorporate 
restriction enzyme recognition sites at their respective 5' 
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ends. The oligonucleotide designed for the 5' end of the gene 
incorporates a Bam HI site followed by 20 nucleotides 
corresponding to the N-terminus of the mature protein. The 
oligonucleotide designed for the 3' end incorporates an Xho 
5 I site followed by 20 untranslated nucleotides. The PCR 
product generated from these primers contains the mature 
INGAP sequence and the native protein termination codon. 

The following is the sequence of the oligonucleotides 
used: 

10 

5' of INGAP (SEQ ID NO: 2 ) 

5'-CCGCGGArCCCGAAGAATCTCAAAAGAAACT-3' 
3' of INGAP (SEQ ID NO: 3) 

5'- GACCGGCrCGAGTGCTCTTCCTGAGTGAATCC 



PCR of INGMAT 
Reaction conditions 

20 



Template: (50 ng INGAP original cDNA 5 //] 
removed from pCDNA3) 

Mgd 2 : 4 fjl 

10 X PCR buffer 5/*l 

25 dATP l/il 

dCTP 1 ^l 

dGTP 1^1 

dTTP 1 fd 

5' primer 1 fA 

y primer 1 /i\ 

30 H 2 0 29 fd 

Taq polymerase 1 fA 



total volume - 50 fA 



35 cycle parameters 

A) 2 min at 95° C 

B) 30 cycles of (1 min 95° C, 1 min 55° C, 1 min 72° 
C.) 

40 C) 7 min at 72° C 

D) 4° C. until removed from thermal cycler. 



55 The PCR products were then electrophoresed on a 5% 
PAGE in TBE. Ethidium bromide stained PCR products 
corresponding to the expected size for the construct were cut 
from the gel. The gel fragments were electro-eluted into 0.5 

6o ml of TBE, precipitated with 50 fd 3M sodium acetate and 
1 ml of isopropanol at -80° C. for 20 min, centrifuged, 
washed once with 1 ml of isopropanol, washed once with 1 
ml of 70% ethanol, and then dried under vacuum. The dried 
pellet was resuspended in 50 fd H 2 0 and quantified. At the 

65 end of this step the sequence of the PCR product that 
contains both restriction sites minus the signal sequence and 
5* UTR was as follows (SEQ ID NO: 4): 
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5-CC 


GCG 


GAT 


CCC 


GAA 


GAA 


TCT 


CAA 


AAG 


AAA 


CTGCCT 




Tcr 


TCA 


CGT 


ATA 


ACC 


TGT 


CCT 


CAA 


GGC 


TCT 


GTA 


GCC 


TAT 


GGG 


TCC 


TAT 


TGC 


TAT 


TCA 


CTG 


ATT 


TTG 


ATA 


CCA 


CAG 


ACC 


TGG 


TCT 


AAT 


GCA 


GAA 


CTA 


TCC 


TGC 


CAG 


ATG 


CAT 


rrc 


TCA 


GGA 


CAC 


CTG 


GCA 


TIT 


CTT 


CTC 


AGT 


ACT 


GGT 


GAA 


ATT 


ACC 


rrc 


GTG 


TCC 


TCC 


CTT 


GTG 


AAG 


AAC 


AGT 


TTG 


ACG 


GCC 


TAC 


CAG 


TAC 


ATC 


TGG 


ATT 


GGA 


CTC 


CAT 


GAT 


CCC 


TCA 


CAT 


GGT 


ACA 


CTA 


CCC 


AAC 


GGA 


AGT 


GGA 


TGG 


AGG 


TGG 


AGC 


AGT 




TCC 


AAT 


GTG 


CTG 


ACC 


TTC 


TAT 


AAC 


TGG 


GAG 


AGG 


AAC 


CCC 


TCT 


ATT 


GOT 


GOT 


GAC 


CGT 


GGT 


TAT 


TGT 


GCA 


GTT 


TTG 


TCT 


CAG 


AAA 


TCA 


GOT 


TTT 


CAG 


AAG 


TGG 


AGA 


GAT 


TTT 


AAT 


TGT 


GAA 


AAT 


GAG 


CTT 


CCC 


TAT 


ATC 


TGC 


AAA 


TTC 


AAG 


GTC 


TAG 


GGC 


AGT 


TCT 


AAT 


TTC 


AAC 


AGC 


TTG 


AAA 


ATA 


TTA 


TGA 


AGC 


TCA 


CAT 


GGA 


CAA 


GGA 


AGC 


AAG 


TAT 


GAG 


GAT 


TCA 


CTC 




AGG 


AAG 


AGC 


ACT 


CGA 


GCC 


GGT 


C-3 1 













^The boldcd areas represent the primers. 

EXAMPLE 8 

This example describes the creation of a plasmid contain- 
ing an expression construct for expression in eukaryotic 30 
systems. 

Restriction enzyme digestion of the 1NGMAT PCR 
product and the pEBVHis-B vector 

We performed two parallel restriction enzyme digestion 
reactions using 2.5 //g of both the INGMAT PCR product 
and pEBVHis-B vector. INGMAT was digested with Bam 
HI and Xho I simultaneously in a 30 fA volume. pEB VHis-B 
was digested with Bam HI and Xho I simultaneously in a 30 
fi\ volume. Both digestion reactions were carried out at 37° 
C. for a period of 4 hours. After the reactions were 
completed, 400 ng of each was electrophoresed on a 1.5% 
agarose gel and stained with ethidium bromide to assure 
complete digestion. The remainder (~2.1 fig) of both diges- 
tion reactions were passed over a sepharose G-50 column 
twice to remove the small DNA fragments followed by two 
equal volume phenol extractions. The extracted DNA was 
then precipitated with 2 volumes of ethanol and Vio volume 
3M sodium acetate at -80° C. for 20 minutes, centrifuged, 
washed twice with 70% ethanol and dried under vacuum. 
The pellets were resuspended in 25 Hfi and quantified. 

The pEB VHis-B expression system was purchased from 
INVITROGEN Corp. San Diego, Calil 

Ligation of INGMAT into pEB VHis-B 

INGMAT (Bam Hl/Xho I) and pEBVHis-B(Bam Hl/Xho 
I) have compatible ends suitable for ligation. 

Ligation conditions using a 20:1 insert to vector molar 60 
ratio. 



pEBVHis-B(vector) 62 ng 
INGMAT (insert) 80 ng 
10 X ligation buffer 
10 mM rATP 



40 



45 



50 



1/d 
4 M ] 

1/d 



nMA' n 7/ ranS / Ormali0n of Mammalian Cells by Plasmul 
DNA , Molecular and Cellular Biology, vol 7 T 
August 1987, p 2745-2752). ' 

kidney^StSf 40 tfaDSf0mied ^en monkcv 

Kidney cells) were grown on twenty 150 mm diameter ohirs 

conlency ^ ^ niciI,i ^ tre P«on.ycin) to ^0^- 

Each plate was washed twice with 10 mis of PFts an) r 
mis of fresh culture medium added TV ?a , ' , ■ 1 
mixture (2.5 ml) was added ^s^^^^ 
gently, and incubated overnight at 37° C. 

DNA transfection mixture: 



H 2 0 45 °-080 ml 

2.5 M CaCL I- 04 * mJ 

2XBES 0-125 ml 

125 mi 



Final volume - 2.5 ml 



»»*. of 5 p,,^ peltacd , .„ 

J^Z-^ EXAMPLE 9 

COy%£[ Ch ° f P EBVH ^INGMAT transfec.ed 
Ni+ 2 NTA agarose preparation 

discard the supernatant. Add 42 ml7Se S'* s " d * 
pend the resin. Centrifuge for 2 minutes at zmlr \ 
dtscard the supernatant. Add 42 mi o X wa^esu 
pend the res.n. Centrifuge for 2 minutes a, £g an i 

OOlMTris nH«m . MCL>aiMsodlum phosphaie, 
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of binding/lysis buffer A (6 M Guanidine HCL, 0.1M sodium 
phosphate, 0.01M Tris, pH 8.0) and resuspend the resin. 
Centrifuge for 2 minutes at 800xG and discard the super- 
natant. Bring the total volume up to 5 ml with buffer A. The 
5 slurry is now ready for the application of the transfecied 
COS-7 cell extract. 

Transfected COS-7 cells lysis and protein isolation 

io Resuspend the transfected cell pellets in 2.5 ml of lysis 
buffer A. (6M Guanidine HCL, 0.1M sodium phosphate, 
0.01M Tris, pH 8.0). Combine two resuspensions into one 
for a final volume of 5 ml. The lysed cells were passed 
through an 18-gauge needle 4 times, transferred to a 15 ml 

15 centrifuge tube, and centrifuged for 15 minutes at 10,OOOxG. 
The supernatant was collected and 5 ml of prepared Ni 2+ 
NTA was added. The mixture was rotated for 45 minutes. 
The slurry was loaded onto a 1.6 cm diameter column and 
allowed to flow through by gravity. 
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Washes 



Flow through 30 ml of buffer A. (No need to collect.) 
Flow through 30 ml of buffer B (8M Urea, 0.1M Sodium 

25 phosphate, 0.01M Tris, pH 8.0). ( No need to collect.) but 
^280 should be at or near zero before continuing, if not, then 
wash with more. Wash through 20 ml of buffer C, same as 
B but pH 6.3. Collect 3 ml fractions. Wash through 20 ml of 
buffer D, same as B but pH 5.9. Collect 3 ml fractions. 

30 Wash through 20 ml of buffer E, same as B but pH 4.5. 
Collect 3 ml fractions. At this point the protein should be in 
one of the fractions taken. Read the absorbance at 280 of all 
the fractions to discern where the protein is. Fractions 
containing the protein were pooled, concentrated, and ana- 

35 lyzed by Western blot to confirm identity of the protein. 

This example describes an analytical technique confirm- 
ing the identity of the product. 
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Biological activity of the expressed protein 



The ability of the expressed protein to stimulate cell 
proliferation was tested on ARIP cells. These cells exhibited 
a 50% increase in 3 H-TdR incorporation, at doses of ihe 
protein of 10-100 ng/ml. 

45 In summary, we have been ^atjle to express INGAP protein 
in an eukaryotic system b/excluding the 5'UTR and the 
signal peptide. The resultant protein is of the predicted 
molecular size of INGA^monomer and reacts with antibody 
to INGAP in a Western analysis. Hie protein shares with 
INGAP peptide the ability to induce ductal cell proliferation. 



SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 

( i i i ) NUMBER OF SEQUENCES: 6 

( 2 ) INFORMATION FOR SEQ ID NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 94 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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-continued 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTCCAAGACA GCTACCATGA TGCTTCCCAT GACCCTCTGT AGGATGTCTT GGATGCTGCT 60 
TTCCTGCCTG ATGTTCCTTT CTTGGGTGGA AGGT 94 

( 2 ) INFORMATION FOR SEQ ID NO:£ 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CCGCGGATCC CGAAGAATCT CAAAAGAAAC T 3 1 

( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic acid 
•i— ( C ) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 

~ ■ ( x i ) SEQUENCE DESCRIPTION: SEQ ID NO.3: 

\fl GACCGGCTCG AGTGCTCTTC CTGAGTGAAT CC 32 

~z ( 2 ) INFORMATION FOR SEQ ID NO:4: 

y_§ 

( i ) SEQUENCE CHARACTERISTICS: 
2 ( A ) LENGTH: 558 base pairs 

*~y ( B ) TYPE: nudeic acid 

( C ) STRANDEDNESS: single 
2 ( D ) TOPOLOGY: linear 

~ ( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ffi CCGCGGATCC CGAAGAATCT CAAAAGAAAC TGCCTTCTTC ACGTATAACC TGTCCTCAAG 60 

P GCTCTGTAGC CTATGGGTCC TATTGCTATT CACTGATTTT GATACCACAG ACCTGGTCTA 120 

O ATGCAGAACT ATCCTGCCAG ATGCATTTCT CAGGACACCT GGCATTTCTT CTCAGTACTG 180 

GTGAAATTAC CTTCGTGTCC TCCCTTGTGA AGAACAGTTT GACGGCCTAC CAGTACATCT 240 

GGATTGGACT CCATGATCCC T C A CATGGT A CACTACCCAA CGGAAGTGGA TGGAAGTGGA 300 

GCAGTTCCAA TGTGCTGACC TTCTATAACT GGGAGAGGAA CCCCTCTATT GCTGCTGACC 360 

GTGGTTATTG TGCAGTTTTG TCTCAGAAAT CAGGTTTTCA GAAGTGGAGA GATTTTAATT 420 

GTG A A A A TG A GCTTCCCTAT ATCTGCAAAT T C AAGGTCT A GGGCAGTTCT AATTTCAACA 480 

GCTT G A A A A T ATT A TG A AG C TCACATGGAC AAGGAAGCAA GTATGAGGAT TCACTCAGGA 540 

AGAGCACTCG AGCCGGTC 558 

( 2 ) INFORMATION FOR SEQ ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Met Met Leo Pro Met Tbr Leo Cys Arg Met Ser Tip Met Leu Leu Ser 
I 5 10 15 



♦ I 
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Cys Leo Met Phe Lei Sei Trp 



2 0 



( 2 ) INFORMATION FOR SEQ ID N0:& 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 175 «nmo acids 
( B ) TYPE; amino add 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: taesr 



V a I G 1 u G I y 
2 5 



( i i ) MOLECULE TYPE: protein 
(x i) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Pro Mel Tbr Lea Cy. Arg Mel S«T Trp Met Leo Leu Ser 




Cys Leu Met Phe L 

2 0 




e. Ser Trp Val Glo G fy\ G I o Glu Scr Gin Lys Lys 



Leu Pro Ser Ser Ai( 



Me Thi C y 



s Pro Gin Oly Ser Val Ala Tyr Gly 
4 5 



S«. T,, C,. T,r S«. L«. II. L«. II. P.. 01. Tbr T.p S«. A.. A I 



5 0 



Glu Leu Ser Cys GH Met His 



6 5 

Scr Thr Gly G I 



7 0 



Phe Ser Gly Hi* Leu Ala Phe Leu Leu 

75 80 



Me Thr Phe Val Ser Ser Leu Val Lys Asn Ser Leu 
85 90 95 



Tyr Gin Tji Me Trp 
1 0 0 



lie Gly Leu His Asp Pro Ser His Gly 
105 HO 



Thr Leu Pro Asn Gly Ser 
1 1 5 



Gly Trp Lys Trp Ser Scr Scr Asn Vat Leu 
120 12 5 



Thr Phe Tyr Asd Trp Glu Arg Asn 

130 135 



Pro Ser Me Ala Ala Asp Arg Gly 
14 0 



Tyr Cys 
1 4 5 




Al. Val Leu Scr Gl. Lys Scr Gly Phe Gin Lys Trp Arg Asp 

, Asn Cys Glu A.. CI. Leu Pro Tyr lie Cy. Lys Phe Lys V a l/ 
16 5 

We claim: . . 45 

I At A recombinant construct for expression of Islet Neo- * - 
genesis Associated Protein or INGAP activity comprising: 
a first nucleotide sequence encoding amino 'wads 27 to 
175 as shown in SEQ ID NO: 6 operably linked to -a 
transcriptional initiation site and a translation^ initia- 
tion site, wherein a second nucleotide sequence encod- 50 
ing a signal peptide is not present immediately 5 ot 
said first nucleotide seqwaweTJ 
2 The construct of claim 1 wherein nucleotides l-l°ot 
SEQ ID NO: 1 are not present 5* of said first nucleotide 

^^The construct of claim 1 further comprising a third 55 
nucleotide sequence encoding a histidine tag. 

4 The construct of claim 3 wherein the third nucleotide 
sequence is immediately 5' or 3' to said first nucleotide 
sequence. of daim j WQereul tnc transcriptional 60 

initiation site is inducible. . . 

nTrbe construct of claim 1 wherein Ac transcriptional 
initiation site is the lac promoter/operator^ 

7 fine construct of claim 1 further comprising a promoter 
sequence capable of initialing constitutive transcription. 

8 The construct of claim 7 wherein the promoter 
sequence is Rous sarcoma virus long terminal repeat 

(RSVLTRH * — 
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-continued 




9. The construct of claim 1 further comprising a nucle- 
otide sequence encoding a nuclear antigen. 

,0. The construct of claim S^wherein the nuclear antigen 
is HBNA-p +' 

11. The^construct of claim 1 further comprising an origin 
of replication. 

50 12. The construct of claim 11 wherein the origin of 
replication is Epstein Bar Virus (EBV) origin of replication. 

A method of producing biologically active fclet Neo- 
genesis Associated Protein or INGAP protein from a recom- 
binant host cell comprising the steps of: 
culturing a host cell comprising a recombinant construct 
comprising a first nucleotide sequence encoding amino 
acids 27 to 175 as shown in SEQ ID NO: 6 operably 
linked to a transriptional initiation site and a transla- 
tional initiation site, wherein a second nucleotide 
sequence encoding a signal peptide is not present 
immediately 5' of said first nucleotide sequence, and 
recovering protein from said cultured host cell 

14. The method of claim 13 wherein the construct further 
comprises a third nucleotide sequence encoding a histidine 
tag, and INGAP protein is purified using a nickel affinity 
matrix. 

15. A host cell comprising a recombinant construct com- 
prising a first nucleotide sequence encoding amino acids 27 



55 



60 



65 



to 175 as shown i n SEQ ID N O; 6 opcrablv li nked to a |ni+icA-»gn 



^ transcriptional [Iroijsite and a uanslational initiation site. 

^ wherein.a second niirlenhHpy^nnence encoding fi signal 

peptide^ not present imme^atel^5-«pf said first nucleotide 
.s|g,uen^\ 5 

U6, The construct of claim 1 wherein the first nucleotide 
sequence encoding amino acids 27 to 175 comprises nucle- 



otides 12-^56 of SEQ ID NO: 4. 
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17. The method of claim 13 wherein the first nucleotide 
sequence encoding amino acids 27-175 comprises nucle- 
otides 12-456 of SEQ ID NO: 4. 

18. The host cell of claim IS wherein the first nucleotide 
sequence encoding amino acid>27-175 comprises nucle- 
otides 12^56 of SEQ ID NO' 4> 



